Deeply-Supervised Nets

نویسندگان

Chen-Yu Lee

Saining Xie

Patrick W. Gallagher

Zhengyou Zhang

Zhuowen Tu

چکیده

We propose deeply-supervised nets (DSN), a method that simultaneously minimizes classification error and improves the directness and transparency of the hidden layer learning process. We focus our attention on three aspects of traditional convolutional-neuralnetwork-type (CNN-type) architectures: (1) transparency in the effect intermediate layers have on overall classification; (2) discriminativeness and robustness of learned features, especially in early layers; (3) training effectiveness in the face of “vanishing” gradients. To combat these issues, we introduce “companion” objective functions at each hidden layer, in addition to the overall objective function at the output layer (an integrated strategy distinct from layer-wise pretraining). We also analyze our algorithm using techniques extended from stochastic gradient methods. The advantages provided by our method are evident in our experimental results, showing state-of-the-art performance on MNIST, CIFAR-10, CIFAR-100, and SVHN.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coupling Supervised and Unsupervised Techniques in Training Feed-Forward Nets

A popular approach to training feed-forward nets is to treat the problem of adaptation as a function approximation and to use curve fitting techniques. We discuss here the problems which the use of pure curve fitting techniques entail for the generalization capability and robustness of the net. These problems are in general inherently associated with the use of pure supervised learning techniqu...

متن کامل

A Comparative Evaluation of Deep Belief Nets in Semi-supervised Learning

In this report I studied the performance of deep belief nets (DBNs) on semisupervised learning problems, in which only a small proportion of data are labeled. First the performance between DBNs and support vector machines (SVMs) are compared to investigate the advantage of deep models over shallow ones. I also explored the use of DBNs as pre-training for SVMs and feed-forward nets (FFNs). The e...

متن کامل

Extended Simple Colored Petri Nets

We present a new class of Petri Nets, called Extended Simple Colored Petri Nets (ESCP-nets), which are essentially Simple Colored Petri Nets (SCP-nets) with three added features: first, there is a built-in type of real numbers; second, tokens can be forced to wait in places; third, an interface specifies how the Net can be externally supervised. Even if these added features also add complexity,...

متن کامل

Neural Networks with Java: neural net overview

What are neural nets? The biological model: The human brain The components of a neural net What they can and where they fail The learning process What means "learning" refering to neural nets? Supervised and unsupervised learning Forwardpropagation Backpropagation Selforganization

متن کامل

Supervised models for an OpenAI driving game

Using the Open AI framework, we will use supervised learning with support of linear models and deep neural nets, in order to try to achieve honorable score at playing a driving game without human intervention.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1409.5185 شماره

صفحات -

تاریخ انتشار 2015

Deeply-Supervised Nets

نویسندگان

چکیده

منابع مشابه

Coupling Supervised and Unsupervised Techniques in Training Feed-Forward Nets

A Comparative Evaluation of Deep Belief Nets in Semi-supervised Learning

Extended Simple Colored Petri Nets

Neural Networks with Java: neural net overview

Supervised models for an OpenAI driving game

عنوان ژورنال:

اشتراک گذاری